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Abstract 

Random permutations with distribution conditionally uniform given the set of record 
values can be generated in a unified way, coherently for all values of n. Our central 
example is a two-parameter family of random permutations that are conditionally 
uniform given the counts of upper and lower records. This family interpolates 
between two versions of Ewens' distribution. We discuss characterisations of the 
conditionally uniform permutations, their asymptotic properties, constructions and 
relations to random partitions. 

1 Introduction 

Random permutations 7r„ G 6„ with distribution conditionally uniform given the value of 
some statistic stat offer a wide and most natural generalisation of the uniform distribution 
on the symmetric group It is sometimes possible to define random permutations 
coherently for all values of n, in a way connecting the asymptotic properties of 7r„'s with 
a de Finetti-type representation which generates (vr^) from some limiting form of stat by 
means of a standard sampling procedure. 

The most studied instance of coherent permutations is the one with stat defined as 
the nondecreasing sequence of cycle-sizes of 7r„, see [H [23]. In this case the sequence 
(stat(7r„), n = 1,2,...) is Kingman's partition structure. The scaled cycle-sizes con- 
verge to a nonincreasing array of frequencies (pk), from which (vr^) can be recovered by 
a stochastic algorithm known as Kingman's paintbox process. A distinguished example 
of coherent permutations with the cycle statistic is the parametric family of Ewens' dis- 
tributions for (7r„), associated with the Poisson-Dirichlet law for the frequencies. Ewens' 
distributions, as well as more general two-parameter distributions due to Pitman, can be 
generated by a simple urn scheme which does not exploit the asymptotic frequencies [23] . 
In fact, for Ewens' family the minimal sufficient statistic is just the number of cycles of 
7r„ and, moreover, every distribution for (7r„) with this property is a unique mixture of 
Ewens' distributions, see [TOl Theorem 12 (i)]. See [lOl [TTl [23] and references therein for 
many examples of partition structures. 

Adopting for stat the sequence of cycle-sizes arranged by increase of minimal elements 
of the cycles leads to a wider type of structure introduced by Pitman f22j in the form of 
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partially exchangeable partitions. The blocks of these ordered partitions correspond to 
the cycles of permutation, with the circular arrangement of elements within the cycles 
ignored. In this case the limiting shape of stat is an arbitrary random array (pk) of 
positive frequencies adding to at most unity. For instance, for Ewens' permutations this 
sequence of frequencies has the GEM distribution, which is a size-biased arrangement of 
the Poisson-Dirichlet law. 



Gnedin and Olshanski |8j studied coherent permutations with the set of descent posi- 
tions of TTn in the role of stat. They showed that coherent sequence of permutations (7r„) 
corresponds to a spreadable random order on N (spreadability, also called contractability, 
means invariance under all increasing injections N ^ N, see [15]). The limiting shape 
of stat was identified with two disjoint open subsets of [0,1]. It was further shown in 
[9] that if the law of each 7r„ is uniform conditionally given the number of descents, then 
(TTn) is a unique mixture of a-shufHes (introduced in [3j) and reversed a-shuffles. In this 
sense the a-shuffles in the setting with descent statistic can be regarded as analogues of 
Ewens' distributions in the setting with cycle statistic. See [Bl Section 8.6] for results in 
the setting where stat is the peak set of permutation. 

Kerov and Tsilevich [TTl HH] studied coherent permutations with stat defined to be 
the set of upper records of 7r„. This structure can be reduced to Pitman's [22] partially ex- 
changeable partitions by the virtue of a fundamental bijection ©„ —>■ which translates 
the record statistics into the cycle statistics pSJ p. 17]. In the interpretation in terms 
of records, the role of limiting shape of stat is played by partial sums + • • 

which are also the upper record values of a random sequence such that the 7r„'s can 

be generated by ranking the variables See [6],[7| for more on partially exchangeable 

partitions and an application to multivariate records. 

In this paper stat is the two-sided set of records of vr^, both upper and lower. We 
extend known results [121 ESI EZl E2] to include both types of records in a symmetric 
way. In particular, Ewens' family of random permutations will be extended to a two- 
parameter family of distributions P^^''^\ Generalising the above mentioned one-sided 
result [101 Theorem 12 (i)] we show that every coherent (7r„) with 7r„ conditionally uniform 
given the counts of upper and lower records is a mixture of the P'^^'''^'s. Permutations 
under p(^'^) can be generated by ranking a sequence of real-valued random variables 
(X„), whose records follow a two-sided analogue of the GEM distribution. This kind of 
representation is also shown for arbitrary coherent sequence of permutations (7r„), with 
each TTn uniformly distributed given its set of record values. Explicit formulas are possible 
for a multiparametric class of distributions for (7r„), which may be regarded as a two-sided 
generalisation of a well-known Pitman's construction of exchangeable partitions [22] . 

2 Counting the records 

Permutations 7r„ G ©„ of [n] := {1, . . . ,n} will be written in the one-row notation as 
Ti'n = (TTni, • • • , '^nn) ■ Wc Call element Tinj a lower record of 7r„ if Unj = min(7r„i, . . . , 7r„j), 
and we call 7r„j an upper record if vr^j = max(7r„i, . . . , 7r„j). When record we say 

that record value and that j is a record time (or a record position). The first 

entry vr^i will be called center. We regard the center as improper lower and upper record. 
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all other records being proper. We denote 



rec 7r.„ 



{r-e, 



,r-i,ro,ri, 



the two-sided increasing sequence of record values, with distinguished center tq = vr„i, 
proper lower records r_£, . . . ,r_i and proper upper records ri, . . . ,r„. In this notation 
£,u count the proper records; for instance, rec(3, 2, 7, 6, 1, 4, 8, 5) = (1,2,3,7,8), where 
the center is boldfaced and i = u = 2. Clearly, r_£ = 1, r„ = n, and the total number 
of records #rec(7r„) = i + u + 1 satisfies min(2,n) <i + u + l<n. The record times 
of proper lower and upper records will be labelled ti, . . . , t„ and t_i, . . . , t_i, respectively, 
and we denote = 1 the record time associated with the improper record. 



Let 



n 



be the number of permutations 7r„ G ©„ with i + 1 lower and u + 1 upper 

and M, and satisfies the 



records. This array of combinatorial numbers is symmetric in 
recursion 



n 




n — 1 




n — 1 


+ (n - 2) 


n — 1 


/+ + 1 




/, u + 1 


+ 


£ + l,u 


/+ 1 





Summing over one of the parameters, say u, yields a signless Stirling number of the first 
kind 



n 

+ 1 



n-l r 

u=0 



n 

+ 1,M 



equal to the number of permutations with £+1 lower records. A more delicate connection 
to the Stirling numbers appears via the identity 

iH /I) I „,\ 

(2) 

found in [2| p. 179], where it was derived by manipulation with generating functions. 

For our purposes it is important to introduce yet another encoding of permutation 
into the sequence of initial ranks 



n 




n — 1 




/ + 1, u + 1 




i + u 





#{A; : k<j, n^k > vr„j}, j G [n]. 



The correspondence 7r„ 



is a well-known bijection between 6„ and [1] x [2] x 
1, and an upper record if ij = j. 
In terms of the initial ranks a bijective proof of ([2]) is easily acquired. To this end, 
consider the mapping which sends vr^ G &n to 7r^_i G &n~i so that the initial ranks are 



X [n]. Note that lower record if in 



transformed as (ii, 



"1' 



"n-l. 



where i'j_;^ 



ijl{ij < j) for 2 < j < n. Each 



proper record of 7r„ is mapped bijectively to a lower record of vr^_i, and the record counts 
satisfy £(7r„) +'u(7r„) = £(vr^) + 1. It is easily seen that 2'' permutations 7r„ are mapped to 
the same vr^.^ each time when •^(vr^_i) + 1 = r, and of these 7r„ there are (^) permutations 
with i proper lower records. Because 7r^_i with r lower records can be chosen in ["~^] 
ways, the identity ([2]) follows. 

When a probability distribution P„ is specified on (3„, we consider 7r,fi clS db random 
variable. In particular, Pi^'^''(7r„) = 1/n! is the uniform distribution (indices will be 
explained in the next section). The characteristic feature of the uniform distribution is 
that the initial ranks are independent, with each ij being uniformly distributed on [j]. 
Giving a probabilistic interpretation to ([2]) we have: 
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Lemma 1. Under the uniform distribution Pn ' for Tin, conditionally given the record 
counts {£, u) and given the positions occupied by £ + u proper records, all (^"^"j allocations 
of i lower records within these i + u positions are equally likely. 



3 A two- parameter family of random permutations 

We introduce next a two-parameter deformation of the uniform distribution, for which 
(£, u) is a sufficient statistic, meaning that given the record counts the distribution of 7r„ 
is uniform. 



Proposition 2. For arbitrary positive 9 and ( the formula 



(3) 



defines a distribution on 6„, which assigns the same probability to every permutation with 
£ + 1 lower and u + 1 upper records. 

(f) t^ 

Proving this amounts to alternative definition of Pn as the probabihty distribution 
under which the initial ranks are independent and satisfy ii = 1 and for j > 1 



1 w.p. 9/{9 + C + j-2), 
J w.p. C/(0 + C+j-2), 
r w.p. l/(6' + C + j-2) for r = 2, 



,3 



(w.p.=with probability). Multiplying these out it is seen that ([3]) is the probability of any 
sequence (^2, • • • , in) where ij = 1 occurs i times and ij = j occurs u times. Thus, Pn'''^ 
is obtained from Pi^'^^ by tilting the probabilities of extreme values of the initial ranks. 

The fact that the probabilities in ([3]) add to unity is also equivalent to the formula for 
the bivariate generating function 



E 



n 

+ 1 



+ C)n- 



l! 



(4) 



which dates back to at least [Ij. For C = 1 this specialises as the well-known formula 



n-l r 

E 

£=0 



n 

+ 1 



+ l)n- 



, Xn a sequence 



for the generating function of Stirling numbers. 

Recall that ranking associates with any sequence of distinct reals Xi, 
of ranks Unj = < n : Xi < Xj}, also called the ranking permutation. Ranking for the 
sequences with repetitions will be introduced in Section [91 

Integer parameters. For integer 9, ( the distribution Pn can be obtained as a pro- 
jection of the uniform distribution P^+d on &n+d, where d = 9 + ( — 2. To ease notation, 
for the rest of this section the elements of permutation are written with one index. 
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Fix {wi, . . . ,Wn+d) £ &n+d- A sequence (tt^, j G [n]) (which is a permutation of 
n integers {6, . . . ,n + 6 — 1}) is uniquely defined by the condition that {tt^, .... n'j} C 
{wi, . . . ,Wd+j} is the subset of integers whose ranks among {wi, . . . ,Wd+j} are neither 
among top C ~ 1 ranks nor among bottom 9 — 1 ranks. Here is the inductive definition. Let 
Si, . . . , Sn+d be the initial ranks oi Wi, . . . , Wn+d- At step 1 we define Tr[ to be the element 
of rank 9 among Wi, . . . , Wd+i, thus leaving ( — 1 elements ranked above and 9 — 1 ranked 
below n'l. At step j the element Wd+j is added, if < Sd+j ^ j + 9 — 1 then vrj = Wd+j, if 
1 < Sd+j < 9 — 1 then Hj is defined to be the element of rank 9 among wi, . . . , Wd+j, and if 
j+9 < Sd+j < j+d then n'j is defined to be the element of rank j+9—1 among wi, . . . , Wd+j. 
Understanding the second arrow in {wi, . . . , Wn+d) ^ i'^i, ■ ■ ■ , T^n) ^ {'^i, ■ ■ ■ ■, '^n) as the 
ranking operation, we have defined a projection fn''^^ from &n+d to ©„. 

Proposition 3. For positive integers 9, ( the mapping fn''^^ sends the uniform distri- 
bution on &n+d ( where d = 9 + ( — 2) to Pn''^^ . 

Proof. In the above, the initial ranks for (tti, . . . , 7r„) and (tt^, . . . , tt^) are the same, and 
are given for j = 2, . . . , n by 

f 1, if sj+d e [1,^], 

ij = <^ sj+d -9 + 1, if sj+d e[9 + l,j + 9- 2], 
[ j, if sj+d e \j + 9-1, j + d]. 

For uniform permutation, Sj+d is uniform on [j + d] and these are independent, hence the 
Tj's are independent with respective probabilities 9 / {n + d — 2), / {n + d — 2) for extreme 
ranks and equal probabilities for other values of ij. □ 

For irrational 9 or ( the distribution Pn''^^ cannot be obtained as a projection of a 
uniform distribution on some combinatorial object. 



4 Coherent permutations 

Our view of permutation is biased towards the interpretation as order, rather than map- 
ping. Orders can be obviously restricted from larger sets to smaller. In this direction, we 
say that permutations 7r„ and tt^, for m < n, are coherent if they determine the same 
order on [m] . A sequence (tTr) of coherent permutations 7r„ G 6„ defines a strict order < 
on the infinite set N: j <\i iff Tinj < T^ni for all ^ inax(j, i). 

Let Dnm '■ ~^ &m > f^) be the projection which cuts the last n — m entries of 7r„ 
and replaces the first m entries iini, . . . , rCnm by their ranking permutation. The projection 
Dnm is the same as restricting orders from [n] to [m], hence the coherence means that 
DnmiT^n) — T^m- The space of all orders on N has the structure of the projective limit 
6°° := hm <&n- This space 6°° should not be confused with the infinite symmetric group 

©oo (of bijections N — > N that displace only finitely many integers), which is the inductive 
limit of finite symmetric groups &oo '■= lim6„. 

In terms of the initial ranks, Dnm '■ (^i, • • • ^ {h, ■ ■ ■ jim) is just the projection 
on the first m coordinates. Every infinite sequence (in) determines an order < on N, in 
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which n is ranked z„th within the set [n] . Therefore ©°° can be identified with the infinite 
product space [1] x [2] x . . . Endowed with the product topology, is a metrisable 
totally disconnected Borel space. When a probability measure is defined on we view 
(7r„) e &°° as a random coherent sequence of permutations, or a random order on M. By 
the measure extension theorem, distributions P„ on defined for every n, determine a 
unique distribution P on (3°° for a coherent sequence of permutations if and only if the 
P„'s are compatible with projections. 

We denote P^^'^^ the measure on under which the initial ranks ^1,22, • • • are in- 
dependent, with distribution as in Section [31 The distributions {Pn'^\ n = 1,2,...) 
introduced in Proposition [3] are coherent projections of P^^''^\ 

For an order < on N we shall say that an upper (or lower) record occurs at time n if 
in = n (respectively, i„ = 1). Reversing the order is an automorphism of (5°°, which is 
written as either vr^j 1— >■ n — vr^j for j G [n], n G N, or, via the initial ranks, 
for n G N. Clearly, reversing the order swaps the types of records, hence maps p(^'^) to 



Remark. Except Dn := -Dn,n-i there are two other useful projections D'„,D" : 6„ 
&n-i- Projection D'^ deletes n in the one-row notation of 7r„, and D'^ deletes n in the cycle 
notation of iin- The projective limit lim((5n, D'^) was introduced in the representation 

theory of ©00 as the space of virtual permutations [IH|, and D'^ was used in [8]. The 
isomorphism of three kinds of projective limits is established by means of the commutative 
diagram 



D'' 



where vr"^ denotes the inverse permutation, and 7r„ denotes the fundamental bijection 
of &n which translates the one-row notation of permutation into the cycle notation of 
another permutation by inserting parentheses ')(' before each proper lower record, e.g. 
(3,2,7,6,1,4,8,5)^= (3)(2, 7, 6)(1, 4, 8, 5) (Stanley [25;, p. 17] gives a slightly different 
version of the mapping). 



5 Specialisations 

Some special values of the parameters ^, C and some limits are worth mentioning. We 
call distribution P on (3°° degenerate if P„(vr„) = for some n and some 7r„ G (5„. All 
distributions P^'^'^^ for 6*, C > are nondegenerate. 
The uniform distribution. The measure 

p(i,i) 

may be called the uniform distribution 
on 6°°, since every Pn'^"* is the uniform distribution on with Pn'^\'Kn) = for 
every 7r„ G &n- The corresponding random order < on N has the characteristic property 
of exchangeability, that is the law of < is invariant under the action of &od- This order 
appears by ranking an iid sample (Xn) from the uniform distribution on [0, 1] (or some 
other contunuous distribution on reals). For fixed n there are also other ways to link 
uniform 7r„ to a sequence of n random reals [12j . 
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Ewens' distributions p(^'^) and P^^''^\ Ewens' distribution on (3„ (also called 9- 
biased permutation, see pLj) is the one which assigns probability 9'^~^/ {6 + l)n-i, to every 
permutation with c cycles. The partition of n comprised of cycle-sizes of 7r„ follows then 
the Ewens sampling formula. 

Suppose C = 1, so the probabilities ^ become 

Pi^'^Vn) = 0'/{e + l)„_i where i + 1 
is the number of lower records of 7r„. When 7r„ follows Pn'^^ then also vr"^, because 
^(^Tn) = ^(tt"^). To see this, draw permutation in two dimensions as a point scatter 
{(j)^ni),j € M}- Observe that the records are those points which do not have other 
points south-west of them. Flip the picture about the diagonal to see that the property 
is preserved. The inversion combined with the ^-mapping in Section H] transforms the 
distribution in its conventional 'cycle form'. Therefore we still call p(^'^) and p(^'^) Ewens' 
distributions (this viewpoint was suggested in [T7]). 

By the same flipping argument, the sequence of lower record times . . . , to 
coincides with the decreasing sequence of lower record values of the inverse permutation 
T^n^i hence under p(^'^) we have further symmetry: (t_£, . . . , to) = (ro, . . . , r_i, r_i). 
Distributions with equal parameters. For = ( there is a symmetry between lower 
and upper records. For distributions Pn'^\7Tn) = 9^^"^ / {29)n+i the minimal sufficient 
statistic is the total number of records I + u + 1. Given the value of this statistic, 7r„ is 
uniformly distributed. 

Bernoulli pyramids p(°^p.°°(i-p)) (0 < p < 1). If C ^ oo but so that O/^O + Q ^p, 
then under the limiting law the probability of vr^ is p^(l — p)" provided £ + m = n — 1, 
and the probability is zero otherwise. Such 7r„ has each 7r„j (j > 1) an upper record with 
probability p and a lower record with probability 1 — p. Only extreme initial ranks are 
possible, i.e ij G Such distributions were exploited in optimal stopping [5]. One 

way to generate such permutation is to split [n] by binomial variable at some integer v, 
then let tti = v for the center and then riffle-shuffle v + 1, . . . ,n and f — 1, . . . , 1 to obtain 
7i2n, • • • , 7r„n. In the cases p = 1 (respectively, p = 0) the distribution concentrates on the 
permutation (ra, . . . , 1) (respectively, (1, . . . , n)). 

Degenerate Ewens' permutations P^^<^\ p(o.C)_ the limiting case ^ — > (but 
C > 0), the permutation has the form vr^ = (l,7r^_j^), where 7r^_^ is a permutation of 

{2, . . . , n} which upon obvious identification has P^-i distribution. In the limiting case 
C — > (but 6 > 0), the permutation has the form 7r„ = (n, 7r^_^), where n'n-i is a 
permutation of [n — 1] which has P^-^i distribution. 

Permutations with only one proper record p(pO'(i-p)o) (0 < J9 < 1). When both 
6,( but so that 6/(6 -\- Q ^ p for some p G [0,1], then the limit law of 7r„ is 
that of (vr„i, 7r„2, 7i'^_2) where (7r„i,7r„2) is either {l,n) or (n, 1) with probability p and 
I — p, respectively, while 7r,^_2 is a uniform permutation of {2, . . . , n — 1} independent of 
(7rni,7r„2). 

Proposition 4. The weak closure of the {9^Q-family is comprised of nondegenerate 
ditributions with 9 > 0, ( > 0, and of three degenerate types described above. 

Proof. This follows by considering P2^'''^ and P^^'''\ □ 
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6 Characterisation of mixtures 



We seek now for a two-parameter generalisation of Theorem 12 (i)], that is we wish to 
characterise the distributions p(^''^) as extreme points of a suitable family of conditionally 
uniform distributions. The following lemma is helpful. 

Lemma 5. Let Qi be the law of an independent 0-1 sequence Bi,B2,... with Bn 
Bernoulli (1/n). Assume Q is a distribution for Bi, B2, . . . with the property that, for each 
n, the conditional law of . . . , B„) given Sn '■= Bi + . . . + Bn and given {Bm, m> n) 
under Q is the same as under Qi. Then Q is a unique mixture of distributions Q^,, 
Tj G [0, 00], under which Bi, B2, . . . are independent with Bn Bernoulli(?7/ {n + rj ~ 1)). 

Proof. This can be concluded from either [211 p. 269] or [TOl Lemma 9]. The key issue is 
that the convergence S'„/logn — 77 holds under Qr^ almost surely. □ 

The first two assertions of the next proposition are equivalent to [12, Theorem 12 (i)] 
and included here for completeness of exposition. 

Proposition 6. Suppose under P the law of Hn for every n = 1,2,... is uniform 
conditionally given the value of a statistic stat. Then the following assertions are true: 

(i) for Stat = ^ distribution P is a unique mixture of P^^'^^ (9 E [0, oo[) and p(i°°'0°°)^ 

(ii) for Stat = u distribution P is a unique mixture of P^^''^^ (C G [0, oo[) and p(o°°'i°°)^ 

(iii) for Stat = £ + u distribution P is a unique mixture of P^^'^^ {6 g]0,cx)[), P( 20-20) 
and p(5°°.5°°)^ 

(iv) for Stat = (£, u) distribution P is a unique mixture of nondegenerate distributions 
P^^'"^) (6',C g]0, oo[), degenerate distributions P(^'°) and p(°'^) (6*, C g]0, oo[), and 
further degenerate distributions p(i-o-oo)^ p(o-o,i-o) p(poo,(i-p)oo) ^ ^j^^ j,^^ 

degenerate distributions do not enter provided P3 > 0. 

Proof. We need to show that the described distributions and only they are extreme. As- 
suming P extreme in the setting of (iv), the tail algebra of the process ((£(7r„), M(7r„)), n = 
1,2, . . .) must be trivial. Let Bn = l(?"n+i G {!,'"' + 1}) be the indicator of some record 
at position n + 1. Under p(^'^) the law of {Bi, B2, . . .) is Q2, hence by Lemma [5] and 
because limS'„/logn is jF-measurable the law of (P„) under P is the same as under 
for some rj. This says that records occur by a Bernoulli process, without specifying the 
types of records. If = the situation is clear: there is only one proper record (for 
n > 1) and p(i o,o-o)^ p(o-o,i-o) g^^.^ ^^le sole possibilities. Suppose rj ^ 0. A key to recognise 
how the records are classified in types is the exchangeability. Let Ik be the indicator of 
the event that the record at {k + l)st record time is a lower record. Conditionally given 
Ji + . . . + Jfe = ^ — 1 all values of the sequence (Ji, . . . , I^) have the same probability 
because by Lemma [T] this is true under p(^'^) and by a simple stopping times 
argument. By de Finetti's theorem, there exists a relative frequency of lower records, 
hence ^(vr„)/(£(7r„) -|- u{7in)) must converge almost surely. But the limit of this ratio is 
jF-measurable hence constant, say p. Appealing again to Lemma [1] we see that (P„) and 
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{Ik) are independent, hence the set of positions of lower records is the one obtained by 
independent thinning with probabihty p of the occurences of I's in (-B„). Thus P = p(^''^) 
with 6 = pi], C = (1 ~ p)v (the instance 77 = oo is included). Part (iii) is shown similarly, 
with the special feature that p = 1/2. □ 



Remark. To put the last result in the framework of [IHIIS], denote Wn{i, u) the probability 
for £ lower and u upper proper records in 7r„. By the rule of addition of probabilities we 
have 

= + + + 1) + (n - wi(0,0) = l, (5) 

which is a recursion dual to ([1]). The set of nonnegative solutions to ([5]) is a convex 

compact set. Proposition [6](iv) describes the set of extreme solutions to ([5]). Interestingly, 

the set of extremes is not closed: each distribution p(pO'(i-p)o) with < p < 1 appears as 

a limit of some nondegenerate P^^^'^^'s, but it is decomposable as a mixture p(pO'(^~p)o) = 
pp(i-o,o-o) _^ _p)p(0 0,10) _ 

A common approach to finding the extreme solutions of ([5]) is based on the analysis of 
asymptotic regimes for I' = i'{n'),u' = u'{n') as n' —>■ 00, which guarantee for all n,i,u 
convergence of the ratios 



n 




n' 




n' / 

I' + l.u' + l 





(6) 



where the numerator is the number of permutations 7r„/ of [n'] with record counts (^', u') 
such that the restriction of 7r„' to [n] has record counts (£, m). Using a monotonicity 
argument, the things can be reversed to show that the convergence ^' / \ogn' and u' / \ogn' 
is necessary and sufficient for the convergence of the ratios (jH]) for all n, m, 



7 Some properties and asymptotics 

As in the case of uniform distribution [20j, asymptotic properties (as n 00) of record 
counts £, u under P^^'"^) follow straightforwardly from the representation via independent 
initial ranks. Thus, both mean and variance of I are asymptotic to ^logn, and that of 
u to C,\ogn. Jointly, (£, u) converge in distribution to independent Gaussian variables. 
The point processes of scaled record times {tk/n : k < 0}, {tk/n : > 0} converge to 
independent Poisson processes with intensities 9dt/t, Qdt/t (for t G [0, 1]), respectively. 

The behaviour of each vr^j under p(^'^) as n varies is that of a process with exchangeable 
0-1 increments, known as Polya's urn model. That is to say, each sequence (vr„j, n > j) is 
a nondecreasing inhomogeneous Markov chain on integers, which starts at some random 
initial rank irjj = ij at time j, and at time n either jumps from some rank 7r„j = f to 
V + 1 with probability {v — 1 + 6)/{n — 2 + 6 + (), or otherwise remains at v. 

The law of rec(7r„) can be expressed in terms of Polya-Eggenberger distributions 



9 



The distribution of the center tq = 7r„i is PE^^'^-*. Conditionally given tq, the lower and 
upper record sequences are independent. The sequence of lower records r_i, . . . ,r_£ is a 
homogeneous decreasing Markov chain on integers which starts at tq and terminates at 1, 
each time descending from the generic r to r — d with probability PEl^'^\d). In a similar 
way, the sequence of upper records ri, . . . ,r„ is a homogeneous increasing Markov chain 
on integers which starts at ro and terminates at n, each time ascending from some r to 
r + d with probability PE';^^^^^{d) . 

Asymptotics of the record values follow from well known properties of Polya urns. 
Recall that beta(a, b) distribution with parameters a > 0, 6 > is the distribution on 
[0, 1] with density - xf-^/B{a,b), where B(a,6) = r(a)r(6)/r(a + b). 

Proposition 7. As n ^ oo, under P^^'*^) the scaled record values of iTn converge, 

— ^ Pk a.s. (k G Z). 

n 

The distribution of po is beta(^,(^). Given po the sequences {pk,k < 0) and {pk,k > 0) 
are independent and representable as 

Pk = roTkTk+i ■ ■ ■ T_i (A; < 0), pfc = 1 - (1 - ro)ZiZ2 ■ ■ ■ Zk (A: > 0), 

where T^'s are beta(6', 1), Z^'s are beta(C, 1) and the variables po, {k < 0) and Z^ 
{k > 0) are all independent. 

Let S be the space of two-sided nondecreasing sequences {xk, k E Z), Xk E [0, 1]. We 
endow S with the product topology of IlfcL-oofO' Padding rec(7r„) by infinitely many 
I's on the left and infinitely many ra's on the right, and scaling by n makes n~^rec(7r,„) a 
random element of S 

n"^rec(7r„) = (..., 1/n, 1/n, r_ejn r_i/n, ro/n, ri/n, . . . , r„/n, 1,1,...). 

Proposition [7| is a strong law of large numbers which says that r?,^^rec(7r„) converge in S 
almost surely to a limiting 'shape' (p^). 

Recall that GEM(^) distribution is the law of the sequence of gaps obtained by break- 
ing [0,1] at atoms of the Poisson point process with intensity 9dx/x (x G [0,1]). The 
decreasing sequence of atoms has the same distribution as the sequence of 'stick-breaking' 
products Di, D1D2, ■ ■ ., with the D/s being iid beta(^, 1). 

The two-sided sequence (p^, G Z) is obtained in a similar way, by splitting [0, 1] at 
Po, and further partitioning the intervals [0, po] and [po, 0] by two independent beta stick- 
breakings with parameters 9 and (. By analogy, the sequence of gaps pk+i — Pk, k E Z, 
may be regarded as a two-sided version of GEM distribution. 

Generalising the classical case of sampling from iid uniforms [24, Proposition 4.11.2], 
the distribution of the bivariate point process of upper (or lower) record values and du- 
rations follows from the spraying property of Poisson processes. Thus, given po the point 
processes {{pk,tk+i — tk), k > 0} and {{pk,tk-i — tk), k < 0}, are independent Poisson, 
with intensity measures (x^~^dx on [po, 1] x N and 9{l — xy~^dx on [0, po] x N, respectively. 
In particular, by the projection property of Poisson processes, given po the conditional 
distribution of the number of pairs of neighbouring lower records < : tk-i — tk = 1} 
is Poisson(6'po) (an equivalent result is shown in [TJ', Corollary 3.1] by computation of 
moments) . 
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8 Generating random permutations 



Under P^^'"^) not only the scaled record values converge (see Proposition [7]), but also 
scaled permutations (7r„j/n, j G N) converge almost surely to some random sequence 
(Xj) E [0,1]°°. In the case of uniform distribution P^^'^\ the sequence (Xj) is just iid 
uniform[0, 1], and (vr^) can be generated by ranking (Xj). Under any P(^'^), (Xj) can be 
produced by a kind of shuffling of the sequences of record values (pfc, k > 0), {pk, k < 0) 
and another independent sequence of uniform variables. Here and henceforth, under 
shuffling of a few sequences we understand a sequence which is comprised of terms of all 
these sequences arranged in such a way that each of the sequences enters in its original 
order. 

Construction 8. Let (Wn) be iid uniform[0, 1], independent of (pfc). We define a new 
sequence (Xn) where some WnS are used, and some are replaced by p^s which will appear 
as upper and lower record values. Start with Xi = pi. Suppose before step n + 1 the 
values p-e, ■ ■ ■ , Pu have been included into Xi, . . . , X„; then pu = max(Xi, . . . , X„) and 
p-e = min(Xi, . . . , X„). At step n + 1 we let Xn+i = Pu+i if 7r„+i > pu, or = p_f_i 
if TTn+i < P-e, or Xn+i = Tin+i othcrwisc. Define a coherent sequence of permutations 
(7r„) by ranking 

It is obvious that, given (pk), the sequence resulting from the construction has the 
same law as iid uniform[0, 1] sequence conditioned on its two-sided sequence of record 
values (see [13] for the one-sided case of upper records). This works for any 9,( because 
conditionally given (p^) the distribution of (vr^) under any p(^'^) is the same as under the 
uniform distribution P^^'^\ 

For every fixed n a similar procedure yields uniform permutation 7r„ conditioned on 
rec(7r„). Start with setting Uni = tq. At each step j > 1 we will have vr^i, . . . , 7r„j_i 
already determined, with some maximum max(7r„i, . . . , TCnj-i) = r^i and some minimum 
min(7r„i, . . . , 7r„ = r_£/. At step j G {2, a value v is chosen uniformly at 

random from [n] \ {uni, ■ ■ ■ , Unj-i}- If f < r.^/ let Unj = t_^/_i, if t> > r„/ let 7r„j = Vu'+i, 
and if r_£/ < v < Vu' let TCnj = v. The sampled value v is replaced each time v breaks the 
last upper or lower record. In n steps the increasing sequences (r_^, . . . , r_i), (ri, . . . , r„) 
are shuffled with other elements of [n]. It is intiutively clear and not hard to show that, as 
n becomes large, n^-'^rec(7r„) = n~^{. . . , 1, r_i, . . . r_i, rg, ri, . . . , r„, n, . . .) will converge in 
S to (pfc). This is just because sampling from large finite sets will have nearly the same 
effect as independent uniform choices from [0, 1]. 

Apparently, from the viewpoint of statistical theory of extremes the sequence (Xn) is 
rather exotic, as it is chosen just to simulate desired behaviour of records. This differs 
general p(^'^) from the uniform distribution P^^'^\ when 'injecting' some extrinsic (p^) 
is not at all necessary since the uniform sample (Wn) supplies automatically appropriate 

record values, so (X„) = (Wn). Still, in the case of integer parameters there is a simpler 
way to produce appropriate (X„) from a sequence of uniforms, as parallels the construction 
of permutations in Proposition [21 

Integer values of the parameters. The idea is to assume some 'prehistorical' sample 
of uniforms. Suppose 6^ > 1, ^ > 1 are integers. For d = 9 + ( — 2 let Vi, . . . ,Vd, Wi, W2, ■ ■ ■ 
be iid uniform[0, 1]. At step 1 choose Xi as the value of rank 9 among Vi, . . . , Vd, Wi. 
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At each step n we will have max(Xi, . . . equal to the {n — 9 + l)th order statis- 
tic in Vl, . . . , Vrf, Wi, . . . , Wn, and min(Xi, . . . , X„) equal to the ^th order statistic in 
Xi, . . . , Xrf, Wi, . . . , Wn- If VTn+i > max(Xi, . . . , X„) we set X„+i equal to the (n+6'— l)th 
order statistic in Vi, . . . , V^, Wi, . . . , W„+i, if Wn+i < min(Xi, . . . , X„) we set X„+i 
equal to the ^th order statistic in Vi, . . . , V^, VTi, . . . , 1V„, VFri+i, and otherwise let X„+i = 
Wn+i- This works, since there are always 9 spacings below min(Xi, . . . ,X„) and ( spac- 
ings above min(Xi, . . . ,X„), thus the resulting ranking is as in the proof of Proposition 

El 

The described process shows that, for integer 6* > 1,C > 1, Proposition [7] is a conse- 
quence of properties of the uniform order statistics. For all other values of 9, ( the result 
can be interpolated from the integer case, because the law of each 7r„ is a rational function 
of the parameters of beta laws for Tk, Zk- 



9 Permutations with the rec statistic 

For arbitrary choice of the distribution for (p^) G S, there is some random sequence 
(X„) resulting from Construction [HI such that given (p^) the law of (X„) is the same as 
for independent uniforms conditioned on the record values. This suggests that arbitrary 
coherent (tt^) with each 7r„ uniform given rec(7r„) can be derived in this manner. In 
general, however, (p^) may have repetitions, therefore we need to be careful with defining 
permutations by ranking. 

We are only interested in the sequences of reals xi,X2, ■ ■ ■ with the property that if 
Xi = Xj for i ^ j then Xj = max(a;i, . . . , Xj) or Xj = min(a;i, . . . , xj). This means that only 
record values can be repeated. We shall define now an order < on N. Suppose first that 
Xi 7^ X2, then we set i<j if either (a): Xi < Xj, or (b): i < j and Xj = Xj = max(xi, . . . ,Xj), 
or (c): j < i and Xj = Xj = min(xi, . . . The rules (b) and (c) are inconsistent if the 
sequence starts with m > 1 repetitions xi = . . . = Xm 7^ Xm+i, in this case all rules apply 
for i,j > m and we just require that each j < m be attributed the initial rank either 1 
or j by some extrinsic rule. For (X^) derived by Construction [81 from arbitrary random 
(pfc) G S and independent uniform (Wj), we define coherent sequence of permutations 
(7r„) by ranking (X„), with account of these rules for repetitions. 

For instance, for constant sequence pk = p, we obtain X„ = p, and = 1 or i„ = n 
according as Wj < p or Wj > p, so this (7r„) is the Bernoulli pyramid p(p°°'(i^p) °°). 
Another example: permutations with single proper record, p(Op.o-(i-p))^ correspond to the 
case when (po,pi) = (0, 1) w.p. p and (p_i,po) = (0, 1) w.p. 1 — p. Conditioning on (p^) 
and on rec(7r„) we have each 7r„ uniformly distributed, whichever the values of (pfc). 

The main result says that this construction is indeed the most general. 

Proposition 9. Let P be a distribution for a coherent sequence of permutations (7r„) 
with the property that, for every n, conditionally given rec(7r„), P„ is a uniform distribu- 
tion. Then rec(7r„) = (r^) satisfies 

— ^ Pk a.s. {k G Z) (7) 
n 
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for some random sequence [pk) with values in S. Conditionally given (pk), the law of 
(Tin) is the same as for the coherent sequence of permutations generated by ranking the 
variables (Xn) determined in Construction [HI 

This is a de Finetti-type representation of (7r„): given (pk), the hmit shape of the 
sufficient statistic ?7,~^rec(7r„), coherent permutations are generated by samphng uniforms 
and shuffling them with (pk)- 

One proof appeals to de Finetti's theorem for 0-1-sequences, and exploits the fact that 
given n is the kth record time tk (so i„ equals 1 or n), the indicator variables l{m <n) 
for m > n are exchangeable, where < is the order on N associated with (7r„). The 
exchangeability implies the existence of limits (I7l). 

Another proof is by reduction to Pitman's characterisation of partially exchangeable 
partitions [22l Theorem 6]. To this end, we need to associate with (7r„) (thought of as 
order < on N) an ordered partition 11 of N in disjoint nonempty blocks {Ak, A; G Z). Let 
Aq := {1} he singleton block. For k > we assign to Ak the kth proper upper record time 
and all integers n <-ordered between the [k — l)st and the kth proper upper record times. 
Similarly, for < we assign to Ak the —kth proper lower record time and all integers 
n <-ordered between the —kth and the {—k + l)st proper lower record times. Thus the 
minimal elements of blocks are the record times {tk, G Z). We order the set of blocks 
{Ak, e Z} by increase of the record values. The sequences (tk, k >0) and (tk, k < 0) 
start with common element to = 1, are increasing and shuffled, that is interlaced in some 
random succession. Conditioning on the succession of record times [tk, k & Z) (which 
could start like e.g. tQ,ti,t2,t_i,t^,t_2, . . .) we obtain a partially exchangeable partition, 
hence [22| Theorem 6] can be applied, from which Proposition [9] follows by unconditioning. 

The differences (pk+i — Pk, k E Z) are the frequencies (pk) of blocks of the ordered 
partition 11. In the event sup < 1 or inf > we have ^ji^g^ pk < I and £ + m ~ 
(1 — J2kezPk)^y i-^- the number of records grows linearly with n. 

10 The boundary of a composition poset 

The classification of coherent permutations with rec statistic fits in the Kerov-Vershik 
framework of potential theory on graded graphs We sketch this aspect of Proposition 

m 

Recall that rec(nn) assumes values in the set of increasing sequences < . . . < ro < 
... < Tu with the first term 1, last term n and a distinguished center tq. By a suitable 
differencing, (r^) can be bijectively encoded into a centered composition of integer n, which 
we define as a sequence of positive integer parts A = (A_^, . . . , A_i, Aq, Ai, . . . , A^) with 
distinguished center Aq = 1 and X]fc=-^ -^fc = n. The connection is established by the 
formulas 

rk = l + X-i + ... + Xk-i {k<0), rk = X-i + ... + Xj {k>0), (8) 
Afc = rfc-rfe_i ik>0), Ao = 1, Xk = rk+i-rk, {k<0). (9) 

The centered composition A corresponding to rec(7r„) is the sequence of block-sizes of 
the ordered partition n„ = Il\[n] from the previous section. 
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The number of centered compositions of n is 2'^~'^{n + 2). The centered compositions 
comprise a graded poset TZ, in which immediate followers of A (centered composition of 
some n) are centered compositions /i obtained by either incrementing one of noncentral 
parts by 1 or by appending 1 to the left or to the right. For instance, (3,1,1,3,2) 
is followed by (1,3,1,1,3,2) , (4,1,1,3,2), (3,2,1,3,2), (3,1,1,4,2), (3,1,1,3,3) and 
(3, 1, 1, 3, 2, 1). The representation of increasing sequences by compositions is convenient 
because passing to a follower requires incrementing only one part. 

The boundary problem for C asks one to find all extreme nonnegative solutions to the 
recursion (/>(A) = Y2f^4'{fJ') with initial condition 0(1) = 1, where the summation is over 
fi which are immediate followers of A. The set of all nonnegative solutions is a compact 
convex set with the property that every its point has a unique representation as convex 
mixture of the extremes (Choquet simplex). 

By some well known general theory each extreme solution appears as a pointwise limit 
0(A) = limm_,oo d{X, fi)/d{fi) for some sequence of centered compositions /i G C of growing 
degree m oo. Here, d{X) is the number of permutations with rec(7r„) = (rj.) and (rj.) 
corresponding to A via ([H]), and d{X, fi) is the number of permutations vr^ which correspond 
to /i and are coherent with some fixed permutation 7r„ having this rec('7r„) = (r^). In 
other words, d{X, fi) is the number of saturated chains in C which interpolate between 
centered compositions A and /i. 

Computing the number of permutations with fixed rec(7rn) yields 

(n - IV 

= -^ / , !\ T- (10) 

where A^ = A^ + Afc+i . . . + A„ for /c > and A^ = A^ + A^-i . . . + X-e for /c < are the 
right and the left tail-sums of A. 

If /i succeeds A in C then /x is of the form /i = {fi^g^b, . . . , fiu+a) (for some a,b > 0) 
and fik > Xk for —i<k<u. For the number of Tr^ coherent with iTn we have 

where are tail-sums of /i. From (fTO!) and (ITT]) 

w X M_,---M_iMi---M„ ^ (/Xfc - l)(A,-iu 
d{X, fi) d{fi) = [[ — — — , 

(m-l)(„_i)| ^^J.^ (Afc-1)! 

where {x)ki = x{x — 1) . . . {x — k + 1) with (x)oj = 1. Analysis of these explicit formulas 
shows that the ratios d{X, fi) / d{fi) converge as m cxd for every A G C if and only if there 
exist limits /ifc/m —>■ for each /c G Z. In such limiting regime for fi with p := (pk), the 
resulting solution is 

-1 u 

0p(A) = n P'^+^pI""' 11(1 - pfc_i)p^-^ (12) 

k=-e k=l 



^Kerov ^16 , Equations 1.4.4, 1.4.4] derived similar one-sided formulas from Stanley's dimension formula 
for coideals in trees. The factors ■:^{^) in [Ml Equation (1.4.4)] should be cor rected as (^C\)- The 
method of [T^ also applies here for a suitable tree and can be used to check pH)) . (fTT|) . 
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where pk = Yl\=-ooPi- Note that for any 7r,„ G ©„ with rec(7r„) = (r^) (corresponding 
to A) we have 0p(A) = Pn{T^n) where P = (P„) is the distribution derived from (pk) by 
Construction [HI From the law of large numbers for this P now follows that each 0p is an 
extreme solution. This again implies Proposition [91 

Finally, we mention one algebraic aspect. For each fixed A G C consider 0,(A) as a 
formal polynomial ([T^ in infinitely many variables {pk, A; G Z). For various A G C these 
polynomials form a basis of an algebra A, which has the property that the structural 
constants of multiplication in this basis are all nonnegative. Moreover, {^k&zPk)(ppW = 
^^(/)p(A), where the sum is over immediate followers p of A. In terms of [19] this means 
that the graded poset C is multiplicative. By the Kerov-Vershik ring theorem (see [HI 
Section 8.7] for detailed proof) extreme solutions have the form 0(A) = x(0,(A)) where 
X is a homomorphism x : ^ ^ R of algebras, which satisfies x {^kezPk) — 1 ^l^o 
satisfies the positivity condition x(0,(A)) > for A G C. Proposition [9] parametrises all 
such X by sequences (pk) G S, so that on the basis (0,(A),A G C) the homorphism is 
the specialisation 0,(A) i— > 0p(A) with p = (pk), where Pk = Pk — Pk-i for A; > and 
Pk = Pk- Pk+i for k<0. 

11 Further examples 

Polya's urns allow to construct a large family of distributions for (7r„) that are analogous 
to Pitman's two-parameter partition structures. The idea is to extend the construction of 
the 'Chinese restaurant process' [23l Section 3.2] by tilting probabilities of extreme ranks 
together with intermediate ranks. 

Let ttfc (A; G Z), 6*, C G M be parameters. Consider distribution P such that given 
ii, . . . ,in the next initial rank satisfies 



e+c+n-i 

r w.p. 8^+°+n-i <r <rk, k <0 

r w.p. ^k-i <r <rk, k> 



(13) 



e+c+n-i 

The principal domain of parameters is defined by the conditions of strict positivity 



l-afc>0 (A; 7^0), 6* + a_i + . . . + > (£ G N), C + "i + •••"«> (m G N). 

Parameter ao can be selected arbitrarily. Under such P the probability of every permu- 
tation Tin G &n with rec(7r„) = (r^) is 

0(A_£, ...,1,...,A„) = 

{6 + a_i){6 + «_i + «_2) ■ ■ - {0 + a_i H h 0!-e)iC + tti)(C + ai + ^2) ■ ■ ■ (C + "i H ^ ««) 



+ 



X 



n-l 



JJ (1 - ak)x,-i , 
k=-e 

where the centered composition A = (A_^, . . . , 1, . . . , A„) encodes (vk) via ([8]), ([9]). For the 
parameters in the principal domain the coherent permutations (7r„) are nondegenerate. 
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The instance = corresponds to the P^^'^^-family. Generahsing Proposition [7] and 
speciahsing Proposition [9] we have the following representation. 

Proposition 10. Suppose P is defined by the conditional distributions f|T3|) . with 
parameters in the principal domain. Then under P the scaled record values of Tin converge, 
as n ^ oo, 

— — > Pfc a.s. (k G Z). 

n 

The distribution of po is beta(^,C). Given po the sequences {pk,k < 0) and {pk,k > 0) 
are independent and representable as 

pk = roTkTk+i ■ ■ ■ T^i {k<0), pk = I - {I - ro)ZiZ2 ■ ■ ■ {k > 0), 

where T^ 's are beta(6'+a_fc+a_A:+i + . • l—ak), Z^ 's are beta(C+ai + . • ■+(y.k, 1— a/c) 

and the variables po, Tk {k < 0) and Zk {k > 0) are all independent. 

Asymptotic properties of 7r„ depend essentially on the parameters. For instance, if 
ttfc = a G ]0, 1[ for all /c > and at = 6 G ]0, 1[ for all A; < 0, then the order of growth of 
the number of upper records is n", and of the number of lower records is ra^, very much in 
line with asymptotics of Pitman's partitions [23| Section 3.3]. Extensions for other values 
of parameters, including those outside the principal domain, seem to be unexplored even 
in the one-sided case of upper records as sufficient statistic (or partially exchangeable 
partitions) . 
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